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ENTERED 



SEQUENCE LISTING 
3 (1) GENERAL INFORMATION: 

5 (i) APPLICANT: SANCHIS, Vincent 

6 LERECLUS , Didier 

7 MENOU, Ghislaine 

8 LECADET, Marguerite-Marie 

9 MARTOURET, Daniel 
10 DEDONDER, Raymond 

12 (ii) TITLE OF INVENTION: NUCLEOTIDE SEQUENCES CODING FOR 

13 POLYPEPTIDES ENDOWED WITH A LARVICIDAL ACTIVITY TOWARDS 

14 LEPIDOPTERA 
16 (iii) NUMBER OF SEQUENCES: 2 

18 (iv) CORRESPONDENCE ADDRESS: 

19 (A) ADDRESSEE: BURNS , DOANE, SWECKER & MATHIS 

20 (B) STREET: P.O. Box 1404 

21 (C) CITY: Alexandria 

22 (D) STATE: Virginia 

23 (E) COUNTRY: USA 

24 (F) ZIP : 22313 

26 (v) COMPUTER READABLE FORM: 

27 (A) MEDIUM TYPE: Floppy disk 

28 (B) COMPUTER: IBM PC compatible 

29 (C) OPERATING SYSTEM: PC -DOS/MS -DOS 

30 (D) SOFTWARE: Patentln Release #1.0, Version #1.25 
32 (vi) CURRENT APPLICATION DATA: 

C--> 33 (A) APPLICATION NUMBER: US/09/918 , 4 85 

C--> 34 (B) FILING DATE: 25-Oct-2001 

35 (C) CLASSIFICATION: 

54 (vii) PRIOR APPLICATION DATA: 

39 (A) APPLICATION NUMBER: US/08/461,551 

40 (B) FILING DATE: 05-JUN-1995 

43 (A) APPLICATION NUMBER: US 08/251,652 

44 (B) FILING DATE: 31-MAY-1994 

47 (A) APPLICATION NUMBER: US 07/458,754 

48 (B) FILING DATE: ll-DEC-1989 

51 (A) APPLICATION NUMBER: EP 88 401 121.4 

52 (B) FILING DATE: 06-MAY-1988 

55 (A) APPLICATION NUMBER: FR 87 08090 

56 (B) FILING DATE: 10-JUN-1987 

58 (Viii) ATTORNEY/AGENT INFORMATION: 

59 (A) NAME: HUNTINGTON, R. D . 

60 (B) REGISTRATION NUMBER: 27,903 

61 (C) REFERENCE/DOCKET NUMBER: 010830-073 
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63 (ix) TELECOMMUNICATION INFORMATION: 

64 (A) TELEPHONE: (703) 836-6620 

65 (B) TELEFAX: (703) 836-2021 
68 (2) INFORMATION FOR SEQ ID NO: 1: 

70 (i) SEQUENCE CHARACTERISTICS: 

71 (A) LENGTH: 2711 base pairs 

72 (B) TYPE: nucleic acid 

73 (C) STRANDEDNESS: single 

74 (D) TOPOLOGY: linear 

76 (ii) MOLECULE TYPE: DNA (genomic) 

80 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 



82 AAGCTTCAAT AGAATCTCAA ATCTCGATGA CTGCTTAGTC TTTTTAATAC TGTCTACTTG 60 

84 ACAGGGGTAG GAACATAATC GGTCAATTTT AAATATGGGG CATATATTGA TATTTTATAA 120 

86 AATTTGTTAC GTTTTTTGTA TTTTTTCATA AGATGTGTCA TATGTATTAA ATCGTGGTAA 180 

88 TGAAAAACAG TATCAAACTA TCAGAACTTT GGTAGTTTAA TAAAAAAACG GAGGTATTTT 24 0 

90 ATGGAGGAAA ATAATCAAAA TCAATGCATA CCTTACAATT GTTTAAGTAA TCCTGAAGAA 300 

92 GTACTTTTGG ATGGAGAACG GATATCAACT GGTAATTACT CAATTGATAT TTCTCTGTCA 3 60 

94 CTTGTTCAGT TTCTGGTATC TAACTTTGTA CCAGGGGGAG GATTTTTAGT TGGATTAATA 4 20 

96 GATTTTGTAT GGGGAATAGT TGGCCCTTCT CAATGGGATG CATTTCTAGT ACAAATTGAA 4 80 

98 CAATTAATTA ATGAAAGAAT AGCTGAATTT GCTAGGAATG CTGCTATTGC TAATTTAGAA 540 
100 GGATTAGGAA ACAATTTCAA TATATATGTG GAAGCATTTA AAGAATGGGA AGAAGATCCT 600 
102 AATAATCCAG CAACCAGGAC CAGAGTAATT GATCGCTTTC GTATACTTGA TGGGCTACTT 660 
104 GAAAGGGACA TTCCTTCGTT TCGAATTTCT GGATTTGAAG TACCCCTTTT ATCCGTTTAT 720 
106 GCTCAAGCGG CCAATCTGCA TCTAGCTATA TTAAGAGATT CTGTAATTTT TGGAGAAAGA 780 
108 TTGGGATTGA CAACGATAAA TGTCAATGAA AACTATAATA GACTAATTAG GCATATTGAT 840 
110 GAATATGCTG ATCACTGTGC AAATACGTAT AATCGGGGAT TAAATAATTT ACCGAAATCT 900 
112 ACGTATCAAG ATTGGATAAC ATATAATCGA TTACGGAGAG ACTTAACATT GACTGTATTA 960 

114 GATATCGCCG CTTTCTTTCC AAACTATGAC AATAGGAGAT ATCCAATTCA GCCAGTTGGT 1020 

116 CAACTAACAA GGGAAGTTTA TACGGACCCA TTAATTAATT TTAATCCACA GTTACAGTCT 1080 

118 GTAGCTCAAT TACCTACTTT TAACGTTATG GAGAGCAGCG CAATTAGAAA TCCTCATTTA 1140 

120 TTTGATATAT TGAATAATCT TACAATCTTT ACGGATTGGT TTAGTGTTGG ACGCAATTTT 1200 

122 TATTGGGGAG GACATCGAGT AATATCTAGC CTTATAGGAG GTGGTAACAT AACATCTCCT 1260 

124 ATATATGGAA GAGAGGCGAA CCAGGAGCCT CCAAGATCCT TTACTTTTAA TGGACCGGTA 1320 

126 TTTAGGACTT TATCAATTCC TACTTTACGA TTATTACAGC AACCTTGCCA GCGCCACCAT 1380 

128 TTTAATTTAC GTGGTGGTGA AGGAGTAGAA TTTTCTACAC CTACAAATAG CTTTACGTAT 1440 

130 GCAGGAAGAG GTACGGTTGA TTCTTTAACT GAATTACCGC CTGAGGATAA TAGTGTGCCA 1500 

132 CCTCGCGAAG GATATAGTCA TCGTTTATGT CATGCAACTT TTGTTCAAAG ATCTGGAACA 1560 

134 CCTTTTTTAA CAACTGGTGT AGTATTTTCT TGGACGCATC GTAGTGCAAC TCTTACAAAT 1620 

13 6 ACAATTGATC CAGAGAGAAT TAATCAAATA CCTTTAGTGA AAGGATTTAG AGTTTGGGGG 1680 
138 GGCACCTCTG TCATTACAGG ACCAGGATTT ACAGGAGGGG ATATCCTTCG AAGAAATACC 1740 
140 TTTGGTGATT TTGTATCTCT ACAAGTCAAT ATTAATTCAC CAATTACCCA AAGATACCGT 1800 
142 TTAAGATTTC GTTACGCTTC CAGTAGGGAT GCAGCAGTTA TAGTATTAAC AGGAGCGGCA 1860 

14 4 TCCACAGGAG TGGGAGGCCA AGTTAGTGTA GATATGCCTC TTCAGAAAAC TATGGAAATA 1920 
146 GGGGAGAACT TAACATCTAG AACATTTAGA TATACCGATT TTAGTAATCC TTTTTCATTT 1980 
14 8 AGAGCTAATC CAGATATAAT TGGGATAAGT GAACAACCTC TATTTGGTGC AGGTTCTATT 2040 
150 AGTAGCGTTG AACTTTATAT AGATAAAATT GAAATTATTC TAGCAGATGC AACATTTGAA 2100 
152 GCAGAATCTG ATTTAGAAAG AGCACAAAAG GCGGTGAATG CCCTGTTTAC TTCTTCCAAT 2160 
154 CAAATCGGGT TAAAAACCGA TGTGACGGAT TATCATATTG ATCAAGTATC CAATTTAGTG 2220 
156 GATTGTTTAT CAGATGAATT TTGTCTGGAT GAAAAGCGAG AATTGTCCGA GAAAGTCAAA 2280 
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158 CATGCGAAGC GACTCAGTGA TGAGCGGAAT TTACTTCAAG ATCCAAACTT CAGAGGGATC 2340 

160 AATAGACAAC CAGACCGTGG CTGGAGAGGA AGTACAGATA TTACCATCCA AGGAGGAGAT 24 00 

162 GACGTATTCA AAGAGAATTA CGTCACACTA CCGGGTACCG TTGATGAGTG CTATCCAACG 2460 

164 TATTTATATC AGAAAATAGA TGAGTCGAAA TTAAAAGCTT ATACCCGTTA TGAATTAAGA 2 520 

166 GGGTATATCG AAGATAGTCA AGACTTAGAA ATCTATTTGA TCGCGTACAA TGCAAAACAC 2580 

168 GAAATAGTAA ATGTGCCAGG CACGGGTTCC TTATGGCCGC TTTCAGCCCA AAGTCCAATC 2640 

170 GGAAAGTGTG GAGAACCGAA TCGATGCGCG CCACACCTTG AATGGAATCC TGATCTAGAT 2 700 

172 TGTTCCTGCA G 2711 
174 (2) INFORMATION FOR SEQ ID NO : 2: 

176 (i) SEQUENCE CHARACTERISTICS: 

177 (A) LENGTH: 823 amino acids 

178 (B) TYPE: amino acid 

179 (C) STRANDEDNESS : unknown 

180 (D) TOPOLOGY: unknown 
182 (ii) MOLECULE TYPE: peptide 

186 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 

188 Met Glu Glu Asn Asn Gin Asn Gin Cys lie Pro Tyr Asn Cys Leu Ser 

189 15 10 15 

192 Asn Pro Glu Glu Val Leu Leu Asp Gly Glu Arg lie Ser Thr Gly Asn 

193 20 25 30 

196 Ser Ser lie Asp lie Ser Leu Ser Leu Val Gin Phe Leu Val Ser Asn 

197 35 40 45 

200 Phe Val Pro Gly Gly Gly Phe Leu Val Gly Leu lie Asp Phe Val Trp 

201 50 55 60 

204 Gly lie Val Gly Pro Ser Gin Trp Asp Ala Phe Leu Val Gin lie Glu 

205 65 70 75 80 

208 Gin Leu lie Asn Glu Arg lie Ala Glu Phe Ala Arg Asn Ala Ala lie 

209 85 90 95 

212 Ala Asn Leu Glu Gly Leu Gly Asn Asn Phe Asn lie Tyr Val Glu Ala 

213 100 105 110 

216 Phe Lys Glu Trp Glu Glu Asp Pro Asn Asn Pro Ala Thr Arg Thr Arg 

217 115 120 125 

220 Val lie Asp Arg Phe Arg lie Leu Asp Gly Leu Leu Glu Arg Asp lie 

221 130 135 140 

224 Pro Ser Phe Arg lie Ser Gly Phe Glu Val Pro Leu Leu Ser Val Tyr 

225 145 150 155 160 

227 Ala Gin Ala Ala Asn Leu His Leu Ala lie Leu Arg Asp Ser Val lie 

228 165 170 175 

231 Phe Gly Glu Arg Trp Gly Leu Thr Thr lie Asn Val Asn Glu Asn Tyr 

232 180 185 190 

235 Asn Arg Leu lie Arg His lie Asp Glu Tyr Ala Asp His Cys Ala Asn 

236 195 200 205 

239 Thr Tyr Asn Arg Gly Leu Asn Asn Leu Pro Lys Ser Thr Tyr Gin Asp 

240 210 215 220 

24 3 Trp lie Thr Tyr Asn Arg Leu Arg Arg Asp Leu Thr Leu Thr Val Leu 

244 225 230 235 240 

24 6 Asp lie Ala Ala Phe Phe Pro Asn Tyr Asp Asn Arg Arg Tyr Pro lie 

247 245 250 255 

250 Gin Pro Val Gly Gin Leu Thr Arg Glu Val Tyr Thr Asp Pro Leu lie 
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